Clustering and Classification of Large Document Bases in a Parallel Environment

نویسندگان

Anthony S. Ruocco

Ophir Frieder

چکیده

In This Issue Bert R. Boyce IN MEMORIAM Jean Tague-Sutcliffe, 1931-1996 Mike Nelson RESEARCH Design and Implementation of Automatic Indexing for Information Retrieval with Arabic Documents Ismaii Hmeidi, Ghassan Kanaan, and Martha Evens Information Using Likeness Measures Martin FrickP Types and Levels of Collaboration in Interdisciplinary Research in the Sciences Jian Qin, F. W. Lancaster, and Bryce Allen Measuring the Impact of Information on Development: A LISREL-Based Study of Small Businesses in Shanghai Liwen Qiu Vaughan and Jean Tague-Sutcliffe Clustering and Classification of Large Document Bases in a Parailel Environment Anthony S. Ruocco and Ophir Frieder BRIEF COMMUNICATIONS Fractional Counting of Multiauihored Publications: Consequences for the Impact of Authors G. Van Hooydonk 8 6 5

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Joint Semantic Vector Representation Model for Text Clustering and Classification

Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...

متن کامل

خوشه‌بندی اسناد مبتنی بر آنتولوژی و رویکرد فازی

Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...

متن کامل

Comparing k-means clusters on parallel Persian-English corpus

This paper compares clusters of aligned Persian and English texts obtained from k-means method. Text clustering has many applications in various fields of natural language processing. So far, much English documents clustering research has been accomplished. Now this question arises, are the results of them extendable to other languages? Since the goal of document clustering is grouping of docum...

متن کامل

Learning Document Image Features With SqueezeNet Convolutional Neural Network

The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...

متن کامل

High Performance Implementation of Fuzzy C-Means and Watershed Algorithms for MRI Segmentation

Image segmentation is one of the most common steps in digital image processing. The area many image segmentation algorithms (e.g., thresholding, edge detection, and region growing) employed for classifying a digital image into different segments. In this connection, finding a suitable algorithm for medical image segmentation is a challenging task due to mainly the noise, low contrast, and steep...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

JASIS

دوره 48 شماره

صفحات -

تاریخ انتشار 1997

Clustering and Classification of Large Document Bases in a Parallel Environment

نویسندگان

چکیده

منابع مشابه

A Joint Semantic Vector Representation Model for Text Clustering and Classification

خوشه‌بندی اسناد مبتنی بر آنتولوژی و رویکرد فازی

Comparing k-means clusters on parallel Persian-English corpus

Learning Document Image Features With SqueezeNet Convolutional Neural Network

High Performance Implementation of Fuzzy C-Means and Watershed Algorithms for MRI Segmentation

عنوان ژورنال:

اشتراک گذاری